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Abstract — In this paper, for the purposes of information 
transmission and network error correction simultaneously, three 
classes of important linear network codes in network coding, 
linear multicast/broadcast/dispersion codes are generalized to 
linear network error correction coding, i.e., linear network 
error correction multicast/broadcast/dispersion codes. We further 
propose the (weakly, strongly) extended Singleton bounds for 
these new classes of codes, and define the optimal codes satisfying 
the corresponding Singleton bounds with equality, which are 
called multicast/broadcast/dispersion MDS codes respectively. 
The existence of such codes are proved by an algebraic method 
and one kind of constructive algorithm is also proposed. 

I. Introduction 

In network communication, the source node can multicast 
the information to all sink nodes at higher rate, if network 
coding is applied in a network, rather than routing alone, JT]- 
|6|. When all kinds of errors may occur in network communi- 
cation, network error correction coding (NEC) was discussed 
widely in order to deal with such problems, |]7]- ifTD . In this 
paper, three types of linear network error correction codes, 
linear network error correction multicast/broadcast/dispersion 
codes, are introduced and studied, which can be regarded as 
the generalization of three types of important linear network 
codes, linear network multicast/broadcast/dispersion codes. 

A communication network is represented by a finite acyclic 
directed graph G = (V,E), where V and E are the sets of 
nodes and channels of the network, respectively. A direct edge 
e = G E stands for a channel leading from node i to 

node j. Node i is called the tail of e and node j is called 
the head of e, denoted by tail(e) and head(e), respectively. 
Correspondingly, the channel e is called an outgoing channel 
of i and an incoming channel of j. For a node i, define 
Out(i) = {e G E : tail(e) = i}, and In(i) — {e G E : 
head(e) — i}. We allow the multiple channels between two 
nodes and assume that one field symbol can be transmitted 
over a channel in a unit time. In this paper, we only consider 
single source networks, and the unique source node is denoted 
by s. Let T be a collection of non-source nodes. A cut between 
the source node s and T is a set of channels whose removal 
disconnects s from all t G T. For unit capacity channels, the 
capacity of a cut between s and T can also be regarded as 
the number of channels in the cut, and the minimum of all 
capacities of cuts between s and T is called the minimum cut 



capacity Ct between the source node s and T. A cut between 
s and T of the minimum cut capacity is called a minimum 
cut. When the collection T just consists of one node i, the 
above concepts degenerate into the general case for node t. In 
particular, the minimum cut capacity between the source node 
s and non-source node t is denoted as Ct- In fact, if we expand 
the single source network G = (V 1 E) into G\ = (V\ 1 E\) by 
installing a new node tx which is connected from every node 
t G T by |/n(t)| multiple unit capacity channels, then the 
minimum cut capacity between the source node s and T in 
G is equal to the minimum cut capacity between the source 
node s and the node ty in G\. 

Let the information rate be uj symbols per unit time. 
Then the source node has ui imaginary incoming channels 
In(s) — {di, d' 2) ■ ■ • jd'^}. The source messages are to sym- 
bols X = (Xi, X2, • • • , X u ) G J 7 " arranged in a row vector. 
When an error occurs on channel e, the output of the channel 
is U e — U e + Z e , where U e is the message that should 
be transmitted over the channel e and Z e G T is the error 
occurred in e. In network G — (V, E), corresponding to each 
channel e G E, an imaginary channel e' is introduced, which is 
connected to the tail of e in order to provide error message Z e . 
The network G — (V, E) with imaginary channels is called 
the extended network of G, where V = V, E = EUE'Uln(s) 
with E' = {e 1 : e G E}. Obviously, \E'\ = \E\. Then 
a linear network code on the original network G can be 
amended to a linear network code on the extended network 
G by setting local encoding coefficients k e > , e = 1, k e i ,d = 
for all d G E\{e}, and others kd, e G J- remains unchanged. 
Note that, for each non-source node i in the extended network, 
In(i) only includes the real incoming channels of i, that 
is, the imaginary channels e' corresponding to e G Out(i) 
are not in In(i). But for the source node s, we still define 
In(s) = {d[, d' 2 , ■ ■ ■ , d'^}. We can also define extended global 
encoding kernels f e for all e G E. The extended global 
encoding kernel f e for e G E is an (cj + \E\) -dimensional 
column vector and the entries can be indexed by the elements 
of In(s)UE. For imaginary message channels d[ L (1 < i < cu) 
and imaginary error channels e' G E', let — 1^, f e ' = le, 
where 1^ is an (oj + \E\) -dimensional column vector which 
is the indicator function of d G In(s) U E. And for other 
global encoding kernels / e , e G E, we have recursive formulae 



fe — J2deln(tail{e)) ^d.efd + le- 

Let Z = (Z e : e G E) be an \E\ -dimensional row vector 
with Z e G J- for all e G E, and Z is called the error message 
vector. An error pattern p is regarded as a set of channels in 
which errors occur and we call that an error message vector 
Z matches an error pattern p, if Z e = for all e G E\p. 

Definition 1: For an error pattern p and an extended global 
encoding kernel f e , e G E, define: 

• fP an (uj + \p |) -dimensional column vector obtained from 
fe = (fe(d) : d G In(s) U E) by removing all entries 
f e (d),d(£lri(s)Up. 

• f£ an (w+ 1 |) -dimensional column vector obtained from 
fe = (fe{d) : d G In(s) U -E) by replacing all entries 
f e (d), d i In(s) U p by 0, and /f 4 f e - /p. 

II. Linear Network Error Correction 
Multicast/Broadcast/Dispersion Codes 

We introduce several new concepts. Some of which are 
regarded as the corresponding generalizations in original linear 
network error correction codes. 

Definition 2: For a linear network error correction code on 
network G, let T be a collection of non-source nodes and 
In(T) = UteTln(t). The decoding matrix Ft respect to T is 
defined as: 

Ft — {fe ■ e G In(T)) = (/ e : e G U teT In(t)) . 

For the collection T, the decoding matrix Ft and U e , e G 
In(T) are available. And then, we can use the following 
equation for decoding: 

(X Z)-F T = (U e : eeIn(T)). 

Denote by rowx(<i) the row vector of Ft indicated by the 
channel d G In{s) U E. Let L be a collection of vectors in 
a vector space. We adopt the convention that (L) represents 
the subspace spanned by vectors in L. At a collection T of 
non-source nodes, the following vector spaces are important. 

Definition 3: For a linear network error correction code on 
network G and a collection T of non-source nodes, define 

$(T,G) = <{row T «): i = 1, 2, • • • ,w}>, 
A(T,p,G) = ({row T (e) : e G p}), 

which is called the message space of T and the error space of 
error pattern p with respect to T, respectively. 

Definition 4: We say that an error pattern p\ is dominated 
by another error pattern p^ with respect to a collection T 
of non-source nodes, if A(T, p\) C A(T, P2) f° r an y linear 
network error correction code. This relation is denoted by 

Pi <T R2- 

Definition 5: The rank of an error pattern p with respect to 
a collection T of non-source nodes is defined as 

ranfcr(p) = min{|//| : p -<t p'}- 

In order to understand the above concept of rank of an error 
pattern, we give the following lemma. 



Lemma 1: Let G = {V, E) be an acyclic network, 
p = {ei,B2,--- ,ei} be an error pattern on G with ej G 
In(ij), (1 < j < I), and T be a collection of non-source 
nodes. Introduce a source node s p and define I channels 
e'j — (s p ,ij). Replace each ej by (1 < j < I) on the 
network G, that is, add e^e^, - - , e[ on the network and 
delete ei, e2, • • • , &i from the network. Then the rank of the 
error pattern p with respect to the collection T is equal to the 
minimum cut capacity between s p and T. 

The proof is similar to that of [|9] Lemma 1], and, there- 
fore, omitted. First, recall the concepts of linear multicast, 
linear broadcast, and linear dispersion, which have important 
applications in practice [2) © fl2l - 

Definition 6: An w-dimensional linear network code on an 
acyclic network G = (V, E) qualifies as a linear multicast, 
linear broadcast, and linear dispersion, respectively, if the 
following hold: 

1) dim(C(In(t))) = u) for every non-source node f e V 
with Ct > u>; 

2) dim(£(7n(i))) = min{cj,Ct} for every non-source 
node t eV; 

3) dim(£(/n(T))) = minl^Cr} for every collection T 
of non-source nodes; 

where C(B) = ({/ e : e G £>}) for any subset B C In(s)UE. 

Definition 7: For an w-dimensional linear network error 
correction code, 

1) it is called regular, if dim(<l>(i)) = u> for any non-source 
node t G V with Ct > uj. 

2) it is called strongly regular, if dim($(i)) = min{a;, Ct} 
for any non-source node t G V. 

3) it is called sup-regular, if dim(<i>(T)) = uj for any 
collection T of non-source nodes with Ct > 

4) it is called strongly sup-regular, if dim(<p(T)) = 
min{u;, Ct} for any collection T of non-source nodes. 

Further, a linear network error correction code is called 
multicast, broadcast and dispersion, if the corresponding reg- 
ular, strongly regular and strongly sup-regular properties are 
satisfied respectively. 

Definition 8: The minimum distance of a linear network 
error correction code on G at any collection T of non-source 
nodes is defined as 

dZi(G) = min{|p| : A(T,p) n *(T) + {0}} 

= min{ranfc T (p) : A(T, p) n $(T) ^ {0}} 
= min{dim(A(T, p) : A(T, p) n $(T) ^ {0}}. 

Further, if T is replaced by a non-source node t G V, it 
is called the minimum distance of a linear network error 
correction code on G at £ G V. 

Similar to original linear network error correction codes 
ifTOll ifTTIl . the above minimum distances fully characterize 
the error-detecting and error-correcting capabilities at the non- 
source node t G V and the collection T of non-source nodes, 
respectively. 

The same as in classical coding theory, some upper bounds 
on these minimum distances are of importance. The following 



upper bounds are similar to the Singleton bound in original 
linear network error correction codes [7 [9] [flOl . and thus we 
call them the extended Singleton bounds. 

Theorem 2 (Singleton Bounds): Consider a strongly sup- 
regular linear network error correction code on a network 
G = (V, E). Let d^2 n (G) be the minimum distance respect to 
a collection T of non-source nodes in V. Then 



Note that Z_ 2 Q because (U ei U e 



^ 0. Since 




if Ct > W) 
if Ct < oj. 



(1) 



This is called the extended Singleton bound. Similarly, replace 
the collection T by one non-source node t, the above bound 
is called the weakly extended Singleton bound. 

^ ( r T\ 

Remark 3: Actually, it is easily seen that d ■ (G) > 1. 
Thus, for the case Ct < oj, we obtain d^Jfir) = 1, 
which implies that there is no error-correcting capability. This 
is consistent with our intuition that the error correction is 
meaningless for the case Ct < oj, since none of source 
messages can be decoded at the collection T. 

Proof: Let T be any collection of non-source nodes 
in V. Since the considered linear network error correction 
code is strongly sup-regular, it follows that dim($(7 1 )) = 
minjw, Ct} from Definition Q To complete the proof, we 
discuss two cases below. 

Case 1: Ct > oj, and thus dim($(T)) = u. 

Let the set of channels {ei,e2,--- ,ec T } be an arbi- 
trary minimum cut between s and T with an upstream-to- 
downstream ancestral order ei -< e<z -< • • • -< ec T , and p = 
{euj, e w +i, • • ■ ,ec T } be an error pattern. As we will show, 
A(2»n$(T) ^ {0}, which implies that d^i <C t -oj + 1. 

Let X be a source message vector and Z_ be an error 
message vector. Then for each channel e £ E, we know 
( X Z) ■ f e = U e , where f e is the extended global encoding 
kernel of e and U e is the output of the channel e. Since the 
rank of the matrix (/ ei f e2 ■ ■ ■ fe u -i) * s at most (oj — 1), 
there exists a nonzero message vector X_ 1 and an all-zero error 
message vector Z_ x = such that 



GKi z x ) ■ (f ei f e . 



Ue a - 1 )=Q, 



Furthermore, since the linear network error correction code is 
strongly sup-regular, this further implies 



(2G z x ) ■ (f ei f e . 



f eo ) = (U ei U e . 



Assume the contrary that (U ei U e 
further implies At = (U e ■ e 



• Ue CT ) y^Q. 

■ ■■■ Uec T ) = 0, which 
£ In(T)) = 0, since 



{ei,e2,-- - ,ec T } is a minimum cut between s and T and 
Z_i = 0. Furthermore, we have (X_ 1 Q)Ft = from the 
decoding equation (X 1 Zl\)Et = At- Therefore, X_ x = 
from dim($(T)) = ui. This contradicts to our assumption 

On the other hand, there exists another source message 
vector X_ 2 = and another error message vector Z_ 2 matching 
the error pattern p = {e u , e^+i, • • • , ec T }, such that 



(2Ca Z 2 )-(f ei 



fe 



e w -< e u+ i -<■■■-< ec T , for any e £ p, we can set 
sequentially with the boundary condition Zd = for all 

d £ E\p: 

deln(tail(e)) 

where U' d is the output of channel d in this case. 
Combining the above, we have 

{X t 0)-F T =(Q Z 2 )-F t , 

which, together with the fact that Z_ 2 ^ matches the error 
pattern p, proves that 

<&(T) n A(7» ^ {0}. 

That is, d^ a (G) < C t -oj+1 for any collection of non-source 
nodes T with Ct > oj. 

Case 2: Ct < oj and thus dim($(T)) = Ct < oj. Similarly, 
apply the same method to a minimum cut {ex, ei, • • • , ec T } 
and an error pattern p = {ec T }- We can obtain d^ n (G) < 
1 for all collections T of non-source nodes with Ct < oj. 
Combining the two cases, the proof is completed. ■ 

Adopt the convention that the codes satisfying Singleton 
bound with equality are called maximum distance separable 
(MDS) codes. 

Definition 9: An w-dimensional linear network error cor- 
rection code over a network G is called linear network error 
correction multicast/broadcast/dispersion MDS code, respec- 
tively, or multicast/broadcast/dispersion MDS code for short , 
if the following hold: 

1) it is regular and d^ in (G) = Ct — oj + 1 for any non- 
source node t £ V with Ct > oj; 

2) it is strongly regular and the weakly extended Singleton 
bound is satisfied with equality for any non-source node 
t £ V; 

3) it is strongly sup-regular and the extended Singleton 
bound ([1) is satisfied with equality for any collection 
T of non-source nodes. 

III. The Existence of the Optimal Codes 

In this section, we show that multicast/broadcast/dispersion 
MDS codes defined above do exist which means that our 
proposed Singleton bounds are achievable. For a collection 
T of non-source nodes with Ct > oj and a non-source node 
t £ V with C t > oj, define: 



Rt(St, G) = { error pattern p 
Rt (St , G) = { error pattern p 



\p\ = rank T {p) = S T }, 
\p\ = rank t {p) = 6 t }, 

where St = Ct — oj + 1 and 5 t — Ct — oj + 1. When there 
is no ambiguity, Rt(St, G) and R t (S t ,G) will be abbreviated 
as Rt(5t) and R t (5 t ), respectively. 

Theorem 4: For a network G = (V, E), there exists an lj- 
dimensional linear network error correction multicast MDS 
code on G, if the size of the base field satisfies: 



1^1 > Yl \M*t)\ 



t£V:C t >u 



In order to prove this result, we need to prepare several 
lemmas below. 

Lemma 5 ( i lTTI Corollary 4]): For each t G V with Ct > 
oj and each error pattern p G Rt(5t), there exist (oj + 5 t ) 
channel disjoint paths from either In(s) — {d'^d^, ■■■ , c^} 
or p' = {e! : e G p] to t satisfying the following properties: 

1) there are exactly S t paths from p' to t, and oj paths from 
In(s) to t; 

2) these 5 t paths from p' to t start with the distinct channels 
in p' and for each path, if it starts with e' G p', then it 
passes through e e p. 

Lemma 6 ( fi3\ Lemma 1]): Let /(xi, X2, ■ • ■ ,x n ) be a 
nonzero polynomial with coefficients in a field J 7 . If |.F| is 
greater than the degree of / for any Xi (1 < i < n), then there 
exist oi, a2, • • ■ , a„ e J such that /(ai, 02, ■ • ■ , a„) ^ 0. 
Lemma 7 ( [9 Lemma 2]): A code is regular and has min- 



imum distance cf*j > d + 1 if 



rank t (p) > d 



Rank(F t p ) > oj + d. 



Proof of Theorem [?} Let t be an arbitrary non-source 
node in V with Ct > oj, and p be an arbitrary error pattern in 
Rt(St). It is not difficult to see that each entry of the decoding 
matrix F t (obviously, F p ) is a polynomial of local encoding 
coefficients kd. e for some channel adjacent pairs (d, e), d, e G 
In(s) U For the non-source node t and the error pattern 
p, there exist (oj + St) channel disjoint paths satisfying the 
following conditions from rankt(p) — St and Lemma [5] 1) 
there are exactly St paths from p' to t, and oj paths from 
In(s) to t; 2) these ^ paths from p 1 to i start with the distinct 
channels in p' and for each path, if it starts with e' G p', then 
it passes through e E p. Further, let In'(t) be the collection 
of the last channels of the uj + St paths. Clearly, In'[t) C 
In(t). For any subset 77 of £7, define a |7j| x matrix = 
(Ad,e)de v ,eeE satisfying: 



A d , e = 



1 d = e; 
otherwise. 



Recall the matrix ill = (/ e : e G E), where all extended 
global encoding kernels are put in juxtaposition according to 
the ancestral order, the matrix B = ( ^ ), where I denotes 
an \E\ x \E\ identity matrix and B = (/c<j, e )de/n( s ), e e£ is 
an uj x \E\ matrix with kd, e = for e ^ Out(s) and 
&d, e being the local encoding coefficient for e G Out(s), 
and the system transfer matrix K = (kd, e )d£E,eeE is an 
\E\ x matrix where k d . e is the local encoding coefficient 
for head(d) = tail(e) and k d:E = for head(d) 7^ tail(e). 
Then we have formula M = B{I — K) . This formula 
(similar to the Koetter-Medard Formula 0) first appears in 

0. Define i„ = f/" x " ^I^A a matrix of size 

V°Wx|J5| A p ) 
(uj + \p\) x (uj+ \E\) = (oj + S t ) x (uj + \E\), where QX b 
represents an a x b zero matrix. Thus, we have 



Ff±A p B(I-K)-iAl 



B(I-K)-iAj n(t) 
U-[A p (I-K)-iA} 



(Ft)in'(t) = A p B(I—K)^ 1 Aj n ,^ 



'B(I-K)-iA] nl{t) ' 
A p (I-K)-'Aj n , (t) 



is a submatrix of Ff and just consists of the column vectors of 
Ff corresponding to the channels in In'(t). Thus put kd, e = 1 
for all adjacent pairs of channels (d, e) along every one of the 
chosen (uj+St) channel disjoint paths, and kd, e = 0, otherwise. 
It follows that (_Ff)j„/ (t ) = I u+ s t , an (oj + S t ) x (oj + 5 t ) 
identity matrix, that is, det((Ff) In i^) = 1, which means 
that det((-F t p )/„'( t )) is a nonzero polynomial. 

Next, we show that the degree of each indeterminate kd, e in 
nonzero polynomial det((i^ p )/„/( t )) is 1 at most. Notice that 



det( 



A P B 
I -K 



A In'{t) 



\i-k o £x(w+5t) 

=(-1)* det((J - K)) ■ det(A p B(I - K^A^^) 

=det((F/') Jn , (t) ).(-ir, 

where det((I — K)) = 1 because K is an upper triangular 
matrix and all elements of main diagonal are zeros. This 
implies that the degree of each indeterminate kd, e in the 
polynomial det((-F ,p )/ n /( t )) is 1 at most. Further, applying 
it to every non-source node t G V with Ct > uj and every 
p G Rt(S t ), it follows that 

[] II det((F t 'W (i) ) 

t£V:C t >w peR t (S t ) 

is also a nonzero polynomial over the base field T, and the de- 
gree of each indeterminate k d , e is at most 2~2tev-c t >u ^ f 
Together with Lemma |6] this proves that if 



1^1 > 



E 1^(^)1. 



teV:C t >u> 

we can choose scalar values in T for all indeterminates such 
that 

J] [] det((F p ) 7rl , (t) )^0, 

teV:C t >u pGRt(St) 

which means all determinants are nonzero, that is, 
R&nk((F t p ) In , (t) ) =uj + S t for all t G T and p G R t (S t ). 

As (Ft ) In i(t) is a submatrix of Ff with the same number 
of rows which is equal to the rank of (Ff) In i^ t ), it is shown 
that Rank(F p ) = oj + St. In addition, wherever p -< t 77, 
Rank(i 7, ( p ) < Rank(F/ 7 ), together with Lemma|7] implies that 
d^ in (G) > St + l for all non-source nodes t G V with Ct > uj. 
The proof is completed by combining what we have proved 
and weakly extended Singleton bound. ■ 

Moreover, the existence of multicast MDS codes leads to 
the existence of broadcast/dispersion MDS codes. 

Theorem 8: There exist cj-dimensional linear network error 
correction broadcast/dispersion MDS codes on a network G, 



if the size of the base field satisfies respectively: 

|^| > ]T \R t (3 t )\ + \V 2 \, 
tev-.c t >u 

m> E El+ET f Et&TCt ) + \v,i 

T£T:C T >u> V T / 

where V% C V is the set of all non-source nodes t E V 
satisfying Ct < uj, T is the set of all collections of non-source 
nodes, and V3 is the set of all collections T G T satisfying 
Ct < uj. 

We give the following corollary with the looser lower bound 
on the size of the base field T. 

Corollary 9: There exists an w-dimensional linear network 
error correction multicast/broadcast MDS code on a network 
G, if the size of the base field satisfies respectively: 

m> £ ('*'), -w> £ C^ + n. 

IV. Construction of the Optimal Codes 

Actually, if we can design an algorithm for constructing 
multicast MDS codes, then it is not difficult to obtain algo- 
rithms for constructing broadcast/dispersion MDS codes. In 
the following, by modifying the algorithm for constructing 
network MDS codes proposed by Guang et al. IfTTl Algorithm 
1], we give an algorithm below for constructing multicast MDS 
codes. First, we introduce some notation. For arbitrary subset 
B C In{s) U E U E', define C(B) = ({/ e : e G B}), 
&(B) = ({h ■■ e e B}), £»(B) = ({fg : e £ B}}, and 
£ p \B) = ({ff :etB}). 

Remark 10: Similarly, we also can extend another im- 
portant linear network code — generic code to network error 
correction. Because of the limit of pages, we omit that part. Cai 
ifPTl proposed strongly generic linear network codes and very 
briefly discussed its application for network error correction. 
But this strongly generic condition is too strong for this 
application. Actually, the better conclusions can be obtained. 
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Algorithm 1: Constructing a multicast MDS code. 
Input: The network G = (V, E), and the information 
rate to. 

Output: Extended global kernels (forming a multicast 
MDS code). 

Initialization: For each non-source node t G V with 
Ct > uj and each p G R t (5 t ), choose (ui + S t ) channel 
disjoint paths from In(s) or p' to t satisfying Lemma [5] 
Denote by Vt, P the set of the above (lu + 5t) channel 
disjoint paths, and E t . p denotes the set of all channels on 
paths in Vt.p- Initialize dynamic channel set 
CXJTt.p = In(s) Up' = {d{, ■■■ ,d'JU{e' : eS p}, as 
well as the extended global encoding kernels f e = l e for 
all imaginary channels e G In(s) U E'. 
foreach node i G V (according to the ancestral order) do 
foreach channel e G Out(i) do 

if e f UteV:C t >u: ^ P eR t (6 t ) E t , p , then 
I f e = l e , all CUT t , p remain unchanged. 

else 

choose g e G £(In(i) U {e'})\ U tev : 
U P zB t(St >.[£ p (CUT tiP \{e(t,p)}f+" 

C p (In(i) U {e'})], where e(t,p) represents 
the previous channel of e on the path which e 
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